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Abstract 


In  recent  years  there  has  been  an  increased  emphasis  on  the 
forecasting  of  earnings.   This  increased  emphasis  is  largely  a  result 
of  a  widespread  recognition  that  future  earnings  is  an  important  factor 
in  investor  decision  making.   This  is  evidenced  by  the  fact  that  the 
Financial  Accounting  Standards  Board  has  cited  future  earnings  as  the 
single  most  important  variable  in  determining  a  stock's  value,  and 
the  Securities  and  Exchange  Commission  has  recently  considered 
requiring  earnings  forecasts  in  external  reports. 

The  increased  emphasis  on  future  earnings  has  led  to  a  complementary 
increased  emphasis  on  methods  of  predicting  future  earnings.   In 
particular  considerable  attention  has  been  given  to  statistical 
methodology  with  respect  to  predicting  future  earnings.   The  reason  for 
this  is  that  the  accuracy  of  forecasts  is  largely  dependent  on  the 
forecast  methodology  employed;  and  in  particular,  if  a  statistical 
methodology  leads  to  misspecified  or  suboptimal  forecasts  then  dysfunctional 
or  suboptimal  decision  making  can  result  from  the  use  of  such  forecasts. 

Tne   present  study  classified  and  evaluated  the  statistical  forecast 
methodologies  that  have  been  used  in  accounting.   It  was  demonstrated 
that  these  methodologies  employ  statistical  models  which,  due  to 
inherent  limitations,  ignore  data  that  might  have  potential  for  improved 
forecasts.   The  purpose  of  the  study  was  to  introduce  and  employ  a 
multivariate  generalization  of  the  univariate  time  series  methodology 
developed  by  Box  and  Jenkins,  which  overcomes  certain  limitations  of 
previously  used  statistical  methodologies.   Specifically  this 
methodology  was  generalized  to  include  an  additional  predictor  variable 
(in  addition  to  earnings  itself).   The  additional  variable  then  was 
used  to  test  the  predictive  value  of  various  ratio,  market  and  industry 
data. 

These  data  were  reduced  to  four  composite  indices  through  the 
use  of  factor  analysis.   The  four  independent  indices  then  were  tested 
individually  to  determine  their  effect  on  the  predictability  of  the 
earnings  forecast.   The  results  indicated  that  these  indices  did 
not  produce  significant  improvement  in  forecasts. 


ON  THE  VALUE  OF  GENERALIZING  THE  BOX- JENKINS  METHOD 
TO  MORE  THAN  ONE  VARIABLE  FOR  THE  FORECASTING  OF  EARNINGS 

William  S.  Hopwood* 


Overview  and  Purpose 

In  recent  years  there  has  been  ac  increased  emphasis  on  the  fore- 
casting of  earnings.  This  increased  emph-asis  is  largely  a  result  of  a 
widespread  recognition  that  future  earnings  is  an  important  factor  in 
investor  decision  making.   This  is  evidenced  by  the  fact  that  the  Finan- 
cial Accountancy  Standards  Board  (FASB)  has  made  the  importance  of  future 
earning  power  a  primary  consideration  in  the  theoretical  framework  under- 
lying their  recently  proposed  Objectives  of  Financial  Reporting  and 
Elements  of  Financial  Statements  of  Business  Enterprises  (Financial  Ac- 
counting Standards  Board,  1977).   In  addition  the  Securities  and  Exchange 
Commission  (SEC)  recently  has  been  considering  requiring  earnings  fore- 
casts in  external  reports  (The  Wall  Street  Journal,  1978). 

The  increased  emphasis  on  future  earnings  also  has  led  to  an  in- 
creased emphasis  on  methods  of  predicting  future  earnings.   In  particular, 
considerable  attention  has  been  given  to  statistical  methods  that  result 
in  a  predicted  future  earnings  (sonie  examples  of  studies  in   this  area  are 
presented  in  Table  1).   llie  reason  for  this  is  that  the  accuracy  of 
forecasts  is  largely  dependent  on  the  forecast  method  employed;  and  in 
particular,  if  a  statistical  method  produces  misspecifled  or  suboptimal 
forecasts  then  dysfunctional  or  suboptiical  decision  making  can  result 
from  the  use  of  such  forecasts. 
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Table  1 


Selected  Studies  Relating  to  the  Statistical 
Prediction  of  Future  Earnings 


(1)  Studies  dealing  with  the  predictability  or  Information  content  of 
accounting  and  nonaccountlng  numbers.   A  large  number  of  studies 
have  been  conducted  in  this  area.   These  include  studies  on  the 
prediction  of  business  failure  [Beaver,  1966;  Altaian,  iS68;  Beaver, 
1968;  Deaken,  1972],  the  prediction  of  market  return  from  ratios 
[Gonedes,  1973]  and  the  times  series  properties  of  accounting  num- 
bers [Beaver,  1970;  Bali  and  Watts,  1972;  Kennally,  1972;  Lookabili, 
1976].   One  reason  for  the  emphasis  on  the  predictive  properties 

of  accounting  numbers  is  that  predictability  can  be  used  as  a 
surrogate  for  usefulness  [Beaver,  Kennally  and  Voss,  1968]  which 
has  been  cited  as  the  primary  objective  of  accounting  data  [Report 
on  the  Study  Group  on  the  Objectives  of  Financial  Statements,  1973; 
American  Accounting  Association,  1966]. 

(2)  Studies  examining  the  forecast  success  of  rnanagers  relative  to 
statistical  models  [Green  and  Segall,  1966;  Cragg  and  Malkiel, 
1968;  Steckler,  1968;  Mincer  and  Zarnovutz,  1969;  Copeland  and 
Marlonl,  1972;  Siovic,  1972;  Lorek  et  al. ,  1976;  and  Brown  and 
Rozeff,  1977], 


The  present  study  deals  with  the  fact  (which  is  demonstrated  below) 
that  the  Box-Jenkins  method  that  previously  has  been  used  in  accounting 
research  has  inherent  limitations  with  respect  to  the  utilization  of  data. 
Specifically  5  it  is  limited  because  it  ignores  data  that:  might  have  the 
potential  to  iciprove  forecasts. 

The  purpose  of  the  study  is  to  explore  the  lisnitation  by  employing 
a  more  general  approach  to  statistical  forecasting  which  incorporates 
additional  data  into  the  forecast  model.   This  approach  will  overcome 
certain  of  the  limitations  of  the  Box-Jenkins  method. 

The  research  method  used  involves  the  use  of  a  transfer  function 
method  developed  by  Box  and  Jenkins  (1970).   This  approach  incorporates 
an  additional  predictor  variable  in  addition  to  past  earnings  in  the 
form  of  an  index  constructed  from  ratio,  industry  and  market  data  via 
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factor  analysis.  The  null  hypothesis  is  that  this  index  will  not  pro- 
duce Improved  forecasts  when  Incorporated  in  the  basic  model. 

The  central  contribution  of  the  study  is  that  information  is  pro- 
vided with  respect  to  the  value  of  adding  data  to  previously  employed 
models.   For  investnent  decisions,  this  information  is  relevant  because 
it  contributes  to  providing  a  basis  for  economic  valuation  of  the  bene- 
fits of  ratio,  industry  and  market  data.   The  need  for  such  valuation 
arises  from  the  fact  that  data  is  generally  obtained  at  a  cost.  Rational 
decisions  therefore  with  respect  to  its  purchase  can  only  be  oiade  rela- 
tive to  its  benefits,  one  of  which  is  its  ability  to  Improve  forecasting. 

The  Traditional  Box-Jenkins  Approach  and  Its  LiiKltations 

In  recent  years  the  Box-Jenkins  method  for  identification  and  esti- 
mation of  univariate  stochastic  time  series  model  has  been  introduced 
in  the  accounting  literature  (Mabert  and  Radcliffe,  i97A).   One  important 
aspect  of  this  method  is  that  it  encompasses  a  very  broad  family  of 
structural  models  (i.e.,  the  ARIMA  models)  that  describe  a  one  variable 
time  series.   In  addition,  the  method  contains  an  identification  process 
that  generally  selects  the  best  model  from  the  ARIMA  class.   The  net 
result  is  the  selection  of  a  model  that  is  parsimonious  in  that  it  con- 
tains a  small  number  of  parameters. 

Research  has  demonstrated  that  forecasts  from  Box-JerJcins  (BJ)  models 
are  more  accurate  than  other  types  of  forecasts.   For  example,  Foster 
(1977)  found  that  they  compared  favorably  with  several  naive  models  and 
Lorek  et  al.  (1976)  recently  applied  this  paradigm  to  accounting  earnings 
series  and  found  that  an  ARE-IA  model  generally  outperformed  the  published 
earnings  forecasts  of  management. 
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Noteable  however  is  that  ARIMA  models  are  univariate  and  have  no 
capability  of  considering  events  that  occur  in  series  other  than  earnings. 
For  example,  if  Industry  performance  vas  a  good  leading  indicator  of  the 
performance  for  a  particular  firm  of  interest,  the  use  of  the  univariate 
Box-Jenkins  method  in  itself  would  preclude  any  statistical  consideration 
of  such  information.  Ignoring  such  information  could,  theoretically, 
lead  to  a  very  poor  forecast  using  an  ARBIA  model.   This  could  be  avoided 
using  a  less  restrictive  forecast  model.  For  example,  using  the  above 
hypothetical  firm  it  might  be  the  case  that  a  severe  drop  in  industry 
performance  would  go  unnoticed  using  the  Box- Jenkins  method.  This  can 
be  seen  by  observing  Table  2  below. 

Table  2 
-  Demonstration  of  the  Limitations  of  ARIMA  Models 


Time  (t) 


t-4     t-3     t-2     t-1      t      t+1 


EPS  for 

hypothetical  .42      .43      .41      ,44      -45     -.20 

firm „. 

industry 

performance  94      92      10      97      98      95 


index 


Assume  that  EPS  for  time  t+1  is  beitig  forecasted  from  time  t.  Note  that 
a  forecast  model  that  depended  upon  the  last  few  values  of  the  EPS  series 
would  forecast  a  value  of  t+1  somewhere  in  the  neighborhood  of  .40  to  .45. 
Further,  note  that  the  deviation  between  the  actual  EPS  (-.20)  and  the 
forecasted  EPS  would  be  very  large. 
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1 
This  problem  could  have  been  mitigated  by  using  information  in 

the  industry  index.  To  demonstrate  this,  assume  that  there  is  a  three 

quarter  delay  between  shocks  in  the  industry  index  and  reactions  in  EPS. 

Using  a  less  restrictive  paradigm  we  could  have  anticipated  a  sharp  drop 

in  EPS  at  time  t-2  when  the  industry  index  suddenly  dropped  to  10. 

The  above  example  is  rather  drajnatic,  but  limited,  in  that  there 

are  other  types  of  events  in  data  internal  and  external  to  the  firm 

Chenceforth  called  internal-external  data  sets)  that  might  prove  useful 

In  forecasting  such  as  trends,  cycles,  etc. 

A  Generalization  of  the  Traditional  Box-Jenkins  Approach 

A  generalization  of  the  ARIMA  model  is  the  transfer  function  (TF) 
which  has  not  been  generally  used  but  has  recently  been  suggested  by 
Foster  (1977).  This  forecast  method,  which  generalizes  the  traditional 
Box-Jenkins  approach,  avoids  the  above  limitations.  In  particular,  it 
generalizes  the  ARIMA  models  by  allowing  for  the  simultaneous  modeling 
of  the  time  series  properties  of  more  than  one  scries  of  interest.   The 

general  form  of  the  transfer  function  is  (1)  y  =  [f , (y  ,,  y  o»>**)> 

.  ,   (1)  ^   (1)     s      .    ,      (2)  ,.   (2)  ,   (n) 

2^  t   »   t~l   '  "•^'  "^3^  t   '  ^t-1   '  °**^'  '**'  n^  t   ' 

X  ^^  ■',  ...)  -S-  u-  v].   Note  that  (1)  completely  generalizes  the  ARIMA 
models  to  remove  the  above  discussed  limitations.  In  particular,  f_, 
f-,  ...,  f  produce  a  generalization  by  allowing  y  to  be  modeled  as 
a  function  ofx,  x,  ...,  x   .   The  net  result  is  a  very  broad 
family  of  models  which  contain  the  KKWA   models  as  a  proper  subset.   In 
summary,  the  transfer  function,  due  to  its  generality,  has  the  ability 
to  utilize  more  data  than  the  ARIK^i.  models.   Specifically,  it  can  simul- 
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taneously  utilize  the  time  series  and  cross-correlational  properties  of 
more  than  one  series  for  the  purpose  of  forecasting  EPS. 

General  Hypotheses 

As  pointed  out  above,  ARIMA  models  ignore  potential  information  in 
the  internal-external  data  series.   Because  of  this  omission  it  is  hy- 
pothesized that  AK.IMA  forecasts  will  be  improyed  by  adding  additional 
data  from  the  internal-external  data  sets.  This  leads  to  the  following 
null  and  alternative  hypotheses. 

Hypothesis 

Bo:  The  average  absolute  percent  forecast  error  for  the  trans- 
fer function  model  utilizing  earnings,  internal  and  external 
data  is  equal  to  the  average  absolute  percent  forecast  error 
for  the  ARIMA  model  utilizing  earnings  data  alone. 

Ha:  The  average  absolute  percent  forecast  error  for  the  trans- 
fer function  model  utilizing  earnings,  internal  and  external 
data  is  less  than  the  average  absolute  percent  forecast  error 
for  the  ARIMA  model  utilizing  earnings  alone. 

Absolute  percentage  error  was  selected  because  it  is  a  measure  that 
establishes  a  comparable  relationship  between  firms  that  b-ave  earnings 
which  are  different  in  absolute  scale.  For  example,,  a  fifty  cents  per 
share  forecast  error  for  a  firm  chat  reports  earnings  of  twenty-five 
cents  per  share  may  be  quite  different  in  significance  than  a  fifty  cents 
per  share  error  for  a  firm  that  reports  earnings  of  five  dollars  per 
share.  In  the  former  case  the  error  is  200  percent  and  the  latter  case 
the  error  is  only  10  percent. 

The  basic  thrust  of  Ha  is  to  assert  that  the  IF  model  will  produce 
more  accurate  forecasts  than  the  ARIMA  model.  Since  the  basic  difference 
between  the  two  is  that  TF  models  more  potential  information,  namely 


-7~ 

internal  and  external  data,  as  opposed  to  earnings  alone,  the  test  in- 
dicates whether  or  not  the  internal  and  external  data  sets  contain  any 
information  as  determined  by  a  lower  forecast  error.  Specifically,  re- 
jection of  the  null  hypothesis  would  imply  that  the  internal-external 
data  set  has  information  content  in  that  it  can  result  in  improved  fore- 
casts when  added  to  the  ARIMA  model.   It  also  would  imply  that  forecasts 
based  on  earnings  alone  may  not  be  optinai  since  they  can  be  improved 
by  adding  more  data  to  the  forecast  model.  The  question  of  optimality, 
however,  must  be  decided  on  whether  or  not  the  improvement  justifies  the 
additional  costs  that  might  be  associated  with  the  acquisition  and  usage 
of  the  additional  data, 

ilesearch  Method 


Sample 

A  sample  of  30  airline  firms  was  selected.   This  industry  was  chosen 
because  the  Civil  Aeronautics  Board  requires  all  certified  air  carriers 
to  file  quarterly  income  statements  and  balance  sheets.  The  fact  that 
balance  sheets  are  reported  is  important  because  the  statistical  analysis 
requires  the  availability  of  data  internal  for  a  firm  for  a  number  of 
accounting  periods . 

The  basic  requirement  for  a  firm  to  be  selected  was  the  availability 
of  income  statement  and  balance  sheets  for  60  quarters.  This  provided 
50  quarters  recommended  for  model  estimation  and  10  quarters  for  forecast 
error  computation.   Since  only  30  firms  in  the  industry  met  the  selection 
criteria,  the  sample  was  not  random.  A  list  of  the  sample  firms  is  pre- 
sented in  Appendix  1. 
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Operational  Definitions 

(a)  "Internal  data":  Tnis  data  set  was  constructed  from  ratios 
and  aggregate  data.   The  ratios  used  were  based  on  those 
compiled  by  Horrigan  (1965)  from  a  large  number  of  sources 
dealing  with  firiancial  statement  analysis  In  various  con- 
texts. The  ratios  were  compiled  to  be  representative  of 
those  used  in  practice.   The  following  is  a  list  of  these 
ratios: 

Short  Term  Liquidity  Ratios 

(a)  Current  asset  to  current  debt 

(b)  Kotes  receivable  plus  cash  to  current  liabilities 

Long  Terra  Solvency  S^atios 

(a)  Net  worth  to  total  debt 

(b)  Net  worth  to  long-term  debt 

Cc)  Net  worth  to  operating  plus  non-operational  property 

Capital  Turnover  Ratios 

(a)  Total  operating  revenue  to  notes  receivable 

(b)  Total  operating  revenue  to  working  capital 

(c)  Total  operating  revenue  to  operating  plus  non- 
operating  property 

(d)  Total  operating  revenue  to  net  worth 

(e)  Total  operating  revenue  to  total  assets 

Profit  Margins  Ratio 

(a)  Net  operating  profit  to  total  operating  revenues 

(b)  Net  income  to  sales 

Return  on  Investment  Ratios 

(a)  Net  operating  profits  to  total  assets 

(b)  Net  profits  to  net  worth 

In  addition  to  the  above  "internal  data"  also  included  working 
capital,  operating  income  and  total  operating  expenses. 

(b)  "External  data":   This  set  included  the  Dow  Jones  Indus- 
trial Index,  Standard  and  poors  Industry  Index  and 

the  firms'  past  stock  price  series.   These  were  chosen 
to  be  representative  of  industry  and  market  factors. 

(c)  "Earnings  data":   This  data  set  will  Include  the  primary 
earnings  per  share  of  cotnmon  before  extraordinary  items 
adjusted  for  stock  splits  and  stock  dividends. 


Construction  and  Application  of  the  Forectast  Models 
Rationale  and  Summary  of  the  Methodology 

The  null  hypothesis  was  tested  by  comparing  the  forecast  errors  of 
a  special  case  of  the  TF  model  to  the  AE.IMA  model  errors.  In  particular, 
this  method  of  testing  was  employed  because  only  a  restricted  form  of 
the  transfer  function  has  been  developed  for  general  use.   This  form  is 
called  the  single  input  transfer  function  and  is  represented  mathematically 
by  the  equation:   (2)  y^  =  [f(y.  ,,  y.  „,  ...),  f(x  .  x   ,,...),  u(t)] 
which  is  the  special  case  of  the  general  form  of  transfer  function  dis- 
cussed above  (henceforth  the  term  transfer  function  will  refer  to  the 
single  input  model). 

In  (2)  the  predicted  variable  y  is  expressed  as  a  function  of  the 
two  variables  in  brackets  on  the  right  side  plus  u(t)  which  is  an  error 
term.  This  relationship  is  graphically  depicted  in  Table.  3  and  differs 
from  the  traditional  Box-Jenkins  analysis  in  that  an  additional  variable 
X  has  been  added. 

Table  3 
Graphical  Depiction  of  the  Single  Input  Transfer  Function 


/•   jj    s    J,        transfer  !,         ,~,^„^ 
input  (predictor)  series    A     .        ^,                output  (t?S)  series^ 
e ij; -u — i:± ^  t  unction   — -^ ^^ — ■ — J^ 


The  result  is  that  equation  (2)  generalizes  the  ARIMA  class  of  models 
to  two  variables  al3.owing  a  data  series  containing  potential  information 
to  be  added,  thus  allowing  a  test  of  the  -.lull  hypothesis.  Theoretically 
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this  allows  the  null  hypothesis  to  be  tested  by  comparing  the  univariate 
forecast  errors  to  those  of  transfer  functions  constructed  by  adding  each 
of  the  variables  in  the  internal-external  data  sets,  one  by  one  to  the 
corresponding  AS.rt-lA  models.   For  example,  for  the  first  sample  firm  we 
could  compare  the  ARIMA  forecasts  to  traasfer  function  forecasts  with  the 
first  ratio  added.  We  would  contine  to  io  this  for  all  items  in  the 
internal-external  data  sets.  Finally  we  would  repeat  the  procedure  for 
the  remaining  29  firms.  Such  an  axialysis  would  work  well  except  for 
three  problems:   (1)  The  individtial  comp.arisons  would  not  be  independent 
of  each  other  and  therefore  interpretation  of  the  results  would  be  con- 
founded.  For  example,  from  a  statistical  standpoint  it  would  be 
inappropriate  to  use  the  Student's  T  test  which  requires  that  Individual 
tests  be  independent  of  each  other.  In  addition  we  would  expect  a  large 
number  of  the  tests  to  be  significant  due  strictly  to  chance.   For  ex- 
ample, for  600  tests  v/ith  each  test  having  an  a   level  of  .1  we  would 
expect  rejections  for  60  (i.e.,  600  x  .Ij  of  these  tests  due  to  chance 
alone.   For  example,  if  we  ignored  this  problem  of  independence  (which 
would  be  highly  improper)  and  found  signj.flcance  in  80  of  the  tests  we 
would  not  want  to  point  to  any  particulai'  test  and  state  that  for  that 
test  a  true  difference  in  the  population  exists.   ITiis  is  because  there 
Is  no  way  to  determine  whether  rejection  occurred  by  chance  (a  error) 
or  was  correct.  These  problems,  depoaderce  between  coaparisons  (statis- 
tical independence)  and  combined  a  errors,  will  be  dealt  with  in  detail 
below  by  using  a  MANOVA  design.   (2)  In  addition  to  the  above,  if  one 
were  to  develop  an  experimental  design  to  overcome  the  problems  of 
statistical  independence  and  combined  a  error,  interpretation  problems 
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of  multicollipearity  among  predictor  variables  would  still  remain.   For 
instance,  assume  that  we  were  able  to  state  that  in  general  the  addition 
of  the  ratio  A  reduces  forecast  error  by  20  percent  and  the  addition  of 
ratio  B  reduces  forecast  error  by  25  percent.  Also  assume  that  the  two 
ratios  are  collinear  and  that  we  are  interested  in  considering  the  im- 
provement in  forecasts  with  both  ratio  A  and  ratio  B  added  at  the  same 
tliQe.  Note  that,  due  to  the  collinearity  of  the  two  ratios,  we  could 
not  say  that  the  combined  improvement  would  be  the  sum  of  the  two  indi- 
vidual improvements.   The  problem  would  become  even  more  complex  with 
three  or  more  ratios  being  added.  Therefore  such  an  approach  would  not 
give  a  measure  of  the  value  of  a  data  set  composed  of  a  group  of  variables 
nor  would  it  enable  one  to  compare  the  predictive  value  of  one  group  of 
variables  to  another.  In  particular,  since  the  internal-external  data 
set  is  a  group  of  variables  (in  the  event  that  it  has  predictive  value) 
it  would  not  be  possible  to  measure  the  predictive  value.   (3)  The 
analysis  would  require  600  transfer  functions  and  30  ARIMA  models,  and 
due  to  these  large  numbers  the  study  would  not  be  feasible  due  to  the 
difficulty  of  processing  such  a  large  number  of  models  (Interactive 
modeling  can  require  as  much  as  30  minutes  connect  time  per  model). 

Due  to  the  problems  of  ffiulticollinearity  between  predictor  variables 
and  large  number  of  models  factor  analysis  with  orthogonal  rotation  was 
used.  Factor  analysis  is  employed  because  it  has  two  useful  properties 
that  overcame  both  problems*   In  particular,  factor  analysis  with  ortho- 
gonal rotation  is  a  statistical  method  which  reduces  a  group  of  variables 
into  a  smaller  number  of  variables  which  are  not  multicollinear  while  at 
the  same  time  retaining  substantially  all  information  (statistical  vari- 
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ation)  in  the  original  group  of  variables  (Morrison,  1967).  Tlxe   net 

result  is  a  small  nmaber  of  non-muiticollinear  variables." 

The  following  is  a  four-step  smmnary  of  the  method  used  to  test  the 

null  hypothesis: 

Step  1;  rne  internal  and  external  data  sets  are  pooled  and  jointly 
factor  analyzed  (using  orthogonal  rotation)  into  uncorre- 
lated  components.   The  resultant  factor  loadings  are  used 
to  construct  indices  (for  a  similar  use  of  factor  analysis, 
see  King,  1966). 

Step  2:   (A)  The  above  indices  are  used  as  input  series  into  linear 

transfer  function  models.  The  single  input  transfer  function 
models  are  identified  and  estimated  for  each  firm  and  input 
series  using  approxiaiateiy  50  quarters  of  data  (the  most 
recent  10  quarters  are  excluded  from  this  step  for  purposes 
of  testing  the  null  hypothesis). 

(B)  The  single  input  transfer  functions  are  applied  to  fore- 
casting the  10  quarters  omitted  in  (A)  above. 

Step  3;  ARIMA  models  for  adjusted  EPS  are  constructed  and  applied  to 
forecasting  in  Step  2. 

Step  A;  For  each  of  the  10  forecasts  mean  absolute  percentage  forecast 

etrors  are  computed  by  factor  index  as  well  as  for  the  ARIMA 

7 
forecasts.   Finally ^  means  for  the  ARBIA  absolute  percentage 

errors  are  compared  to  the  same  transfer  function  means. 

Detailed  Discussion  of  the  Method 

Step  1  (factor  analysis);   Tlie  internal  and  external  data  sets  are  pooled 
into  a  common  data  set  and  then  factored  firm  by  firm  cross- 
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sectionaliy  into  distinct  uncorrelated  coiGponents.   The 
resultant  factor  loadings  are  then  applied  to  the  data  to 
produce  firm  by  firm  indices. 
Each  firm  behaved  approximately  the  same  under  the  factor  analysis; 
typicallyj  four  factors  explained  approximately  80  percent  of  the  vari- 
ation in  the  data  relatijig  to  each  firm.   In  addition,  a  fifth  factor, 
if  computed,  typically  explained  a  very  small  proportion  of  the  total 
variation  (i.e.,  approximately  5  percent).   For  this  reason  four  factors 
were  retained  for  each  firm.   Table  4  presents  a  firm  by  firm  analysis 
of  the  percent  variation  exrplained  for  each  of  the  four  factors.   For 
example,  the  first  factor  for  firm  1  explained  24.7  percent  of  the  vari- 
ation in  firm  1,  factor  2  explained  19.3  percent  of  the  variation  in 
firm  1  and  the  four  factors  combined  explained  a  total  of  72.1  percent 
of  the  variation  of  the  firm  1  data  set. 

These  percentages  can  be  interpreted  in  tercis  of  the  relative  im- 
portance of  their  associated  factors  with  respect  to  information  con- 
teat.   It  should  be  noted;  however,  that  this  information  content  is 
determined  with  respect  to  predicting  the  values  of  the  variables  in 
the  original  unfactored  data  set  and  not  earnings  per  share.   In  addi- 
tion, it  is  possible  to  interpret  the  factors  conceptually  in  terms  of 
the  original  variables.   Such  an  analysis,  hovever,  will  not  be  under- 
taken in  this  research  since  the  research  design  was  oriented  towards 
the  specific  goals  of  data  reduction  and  orthogonaliza tion  rather  than 
interpretation.   (For  a  detailed  interpretive  factor  analysis  see  Pinces 
et  al.,  1975.) 
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Table  4 
Percent  Variation  Explained  for  Each  Firm  by  Factor 


Firm  No, 

Factor  1 

Factor  2 

Factor  3 

Factor  4 

Firm  Total 

1 

24.7 

19.3 

14.9 

13.2 

72.1 

2 

33.2 

21.5 

14.7 

8.4 

77.8 

3 

28.3 

27.5 

13.5 

8.3 

77.7 

4 

35,5 

25.7 

16.2 

6.8 

84.2 

5 

45.6 

20.3 

7,9 

6.6 

80,4 

6 

72.3 

9.9 

6.8 

4.3 

93.4 

7 

54.0 

14.2 

10.6 

7.6 

86.4 

8 

45.8 

20.6 

11.3 

6.4 

84.2 

9 

31.0 

22.0 

18.7 

11.2 

82.8 

10 

30.9 

26.4 

15.7 

9.5 

82.4 

11 

33.9 

19.0 

15.6 

11.9 

80.4 

12 

60.1 

14.2 

9.8 

4.7 

88.7 

13 

32,0 

18.3 

14.7 

11.3 

76.3 

14 

34.7 

22.8 

18,2 

8.7 

84.4 

IS 

37.2 

18.2 

11. 0 

8,7 

75.2 

16 

34.6 

25.2 

12.6 

11,7 

84.0 

17 

-   35.8 

24-7 

17.5 

5-2 

82.6 

18 

32.7 

20.7 

13.7 

10.3 

77.4 

19 

42.8 

18.0 

12.1 

11.3 

84.3 

20 

36.7 

26.8 

17.3 

5.3 

86.1 

21 

32.5 

30.1 

10.2 

7.1 

79.9 

22 

31.1 

19.2 

10.4 

10.3 

70.9 

23 

42.5 

26,2 

11.9 

5.3 

85.9 

24 

30.5 

19.5 

15.4 

10.0 

75.5 

25 

41.5 

20.6 

13.6 

7.5 

83,3 

26 

33.2 

27,6 

14.0 

10.9 

85.7 

27 

36.6 

24.9 

15,8 

6,8 

84.0 

28 

38.5 

19,5 

15.0 

8.5 

81.5 

29 

47.0 

18.3 

17.0 

5,2 

87.4 

30 

32.9 

18.8 

17.6 

11,5 

80.8 
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Steps  2  and  3  (modeling):   For  each  firm  four  transfer  and  one  univariate 
ARIMA.  models  are  constructed.   Each  of  the  four  transfer  models 
correspond  to  one  of  the  four  factor  indices  constructed  above. 
The  total  number  of  models  constructed  is  therefore  120  (30 
firms  with  four  models  each)  for  the  transfer  case  and  30  (one 
for  each  firm)  for  the  univariate  case  resulting  in  a  total  of 
150  models.  Next,  10  forecasts  for  each  model  are  then  gen- 
erated producirig  1,500  forecasts. 

9 

An  example  of  this  procedure  is  given  in  the  case  of  firm  1."   In 

this  case  four  transfer  functions  and  one  ARIMA  model  are  estimated.   In 
each  case  a  base  period  of  50  quarters  is  used  which  Includes  quarters 
beginning  at  the  first  quarter  of  1962  and  ending  with  the  fourth  quarter 
of  1974  (see  Appendix  2  for  a  complete  listing  of  base  periods  and  fore- 
cast horizon).   The  next  step  is  to  compute  forecasts  for  10  quarters 
ahead.  The  results  in  10  forecasts  for  each  single  input  transfer  func- 
tion and  10  forecasts  for  the  ARIMA  model,  giving  a  total  of  50  forecasts 
for  firm  1. 

Step  4  (Test  of  the  null  hypothesis);  A  test  of  null  hypothesis  was  based 
on  a  MANOVA  (multivariate  analysis  of  variance)  design  which 
is  a  simple  generalization  of  ANOVA  (analysis  of  variance). 
Tkie   difference  between  the  two  methods  is  that  ANOVA  tests 
for  differences  between  laeans  for  a  single  variable  whereas 
MANOVA  tests  for  differences  between  means  for  a  group  (i.e., 
vector)  of  variables.   In  particular  Ai^OVA  testa  the  hypothesis: 

U,  ,  ^  b"  -  =  ...  =  U  ,  and  liMOK^A   tests  the  hypothesis: 
1,1    2,1         aiji. 
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"^  1 
1,2 


l"i,pJ 


u; 


2,1 
2,2 


l"2.pj 


^     1 
in,2 


U 
m,p 


where  U,  ,  represents  the  mean  of  variable  j  in  group  i. 

Note  that  if  p  equals  one  then  the  MANOVA  hypothesis  reduces 

to  the  AMOVA  hypothesis. 
The  assumptions  of  MANOVA  are  simple  multivariate  generalizations 
of  the  ANOVA  assumptions  of  normality,  and  homogeneity  of  variances. 
The  first  assumption  (multivariate  normality)  generally  is  considered 
satisfied  for  reasonably  large  samples  by  the  multivariate  central  limit 
theorem  (Morrison,  pp.  80-81),   The  second  assumption  (multivariate 
homogeneity  of  variances)  has  been  found  to  be  robust  in  the  cases  studied 
(Harris,  p.  85;  Morrison,  p.  152). 

MANOVA  can  be  applied  to  the  problem  at  hand  by  letting  the  mean  ab- 
solute percentage  forecast  error  associated  with  each  of  the  four  factor 
indices  by  symbolized  as  forctype.  (i  =  1,2,3,4;  forctj^e  is  defined  as 
forecast  type).   In  addition,  the  mean  absolute  percentage  errors  asso- 
ciated with  the  univariate  series  is  symbolized  as  forctype  .   Finally, 
the  means  are  broken  down  according  to  the  number  of  steps  ahead  (i.e., 
periods  in  the  future)  sjrmbolized  as  step,  (j  =  1,2,...,  10).   To  sisn- 
marize  U.   represents  the  mean  absolute  percentage  error  for  forctype. 
and  step  .   This  design  is  graphically  presented  in  Table  A. 

In  terms  of  the  MANOVA  design  the  null  hypothesis  can  be  stated  as: 
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U 


1,1 

;;i,2 

1   3 

ill 


u 


1,10 


With  the  alternative  being : 


P2 11 

u  » 
^2,2 

^2,3 

« 
• 

^.'lo 

5= 

^'3    1" 

"32 
U    ' 

"3,4 
^3!  10 

sa 

4,4 

• 
♦ 

iS 

• 
• 

^siio 

a1 


"1,4 


&,lQj 


U 


1 

2 
2,3 

2,4 


b2;io 


3  1 

'3  2 
J   * 

3,3 

3,4 


^3,i0j 


4  2 
U   ' 

4  3 
U   * 

4,4 


l!^4,10j 


5,1 
5,2 

5,3 
5,4 


l"5,10 


Note  that  the  null  hypothesis  states  that  there  is  no  difference  be- 
tween any  of  the  five  rows  in  Table  5.  Specifically,  row  5  represents  the 
ARIMA.  forecasts  and  the  first  four  rows  represent  transfer  function  fore- 
casts.  In  addition  the  MAUOVA  test  is  very  powerful  (Cooley  and  Lohnes, 
1971,  p»  228)  and  can  be  expected  to  lead  to  rejection  of  the  null  hy- 
pothesis in  the  event  that  any  two  rows  differ.  In  particular,  if  one 
of  the  first  four  rows  (transfer  functions)  differs  from  the  fifth  row 
(ARIMA  models)  we  would  expect  a  rejection  of  the  null  hypothesis. 

Given  the  above  it  is  now  possible  to  discuss  reasons  for  the  selec- 
tion of  the  MANOVA  design.   First  recall  from  the  above  discussion  on 
the  rationale  for  the  method  that  the  two  potential  problems  of  combined 
a  error  and  statistical  independence  exist  when  making  statistical  com- 
parisons.  This  can  be  seen  in  terms  of  the  above  design  between  rows  in 
each  of  the  ten  columns  corresponding  to  the  10  steps  ahead.  One  could 
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Table  5 


MANOVA  Design  for  Mean  Absolute 
Percentages  Forecast  Errors 


F 
0 
R 
E 
C 
A 
S 
T 

T 
Y 
P 

E 

i 

STEP 

1 

2 

3 

4 

5 

6 

"^       7 

8 

9 

10 

\i 

"l,2 

"1.3 

"1.4 

"1.5 

"1.6 

"1.7 

"1.8 

"1.9 

"l.lO 

2 

\i 

"2,2 

"2.3 

"2.4 

"2,5 

"2.6 

"2.7 

"2,8 

"2,9 

"2.10 

3 

"3.1 

"3,2 

"3,3 

"3,4 

"3,5 

"3,6 

"3.7 

"3,8 

"3,9 

"3.10 

4 

Xl 

\2 

"a,3 

"4.4 

"4.5 

"4.6 

"4.7 

"4.8 

"4.9 

"4.10 

5 

"5,1 

"5.2 

"5.3 

"5,4 

"5.5 

"5.6 

"5.7 

"5.8 

"5.9 

"5.10 

Note  that  there  are  50  cells  with  30  observations  in  each  cell 
accounting  for  the  1,500  forecasts. 
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also  generate  (?)  or  1,225  different  t  tests  by  taking  all  possible  com- 
binations of  cells.  Of  Importance  is  the  fact  that  such  tests  would  not 
be  independeint  nor  would  they  guard  against  combined  alpha  errors,  and 
therefore  they  would  be  highly  inappropriate.  For  this  reason  a  MANOVA 
design  was  chosen  because  it  is  not  subject  to  these  two  problems.   (For 
a  more  complete  discussion  see  Morrison,  1967,  pp.  126-127  and  Finn,  1974, 
p.  320.)  In  addition,  a  second  reason  is,  as  cited  above,  that  MANOVA 
is  very  powerful  and  therefore  is  very  lilcely  to  detect  any  true  popula- 
tion differences. 

Discussion  of  the  Results  of  the  MANOVA 

The  results  for  the  MANOVA  are  presented  below.  The  dotted  lines 

10 
separate  various  methods  of  comparable  te:5ts  of  the  null  hypothesis. 

Wilk's  criterion  (often  called  Wilk's  lambda)  yields  an  F  approximation 
of  .55  with  40  and  552  degrees  of  freedom  which  is  significant  at  the 
.9893  level.  The  results  indicate  that  rtjjection  of  the  null  hypothesis 
is  not  warranted  from  the  sample  data. 

The  Implication  of  the  test  is  that  i:he  five  types  of  forecasts  are 
the  same.  Initially  it  might  appear  surprising  that  the  fourth  factor, 
which  explained  a  relatively  small  proportion  of  the  variation  in  the 
original  data,  performed  as  well  as  the  first  factor  which  explained 
a  relatively  large  proportion  of  the  variation.  This  result,  however, 
can  be  understood  in  light  of  the  nature  of  the  bivarlate  transfer  func- 
tion which  models  the  earnings  and  potent:Lal  information  as  additional 
series  (the  factor  index).  If  the  additional  series  has  no  additional 
Information  with  predictive  value,  we  would  expect  that  the  forecasts 
wotild  be  no  better  or  worse  than  without  t:he  inclusion  of  the  additional 
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varlable.  Therefore,  the  results  are  consistent  with  the  hypothesis 
that  the  factor  indices  provided  no  additional  information  in  terms  of 
forecasting  earnings  than  was  contained  in  the  earnings  series.   In 
addition  to  the  above,  due  to  the  generality  of  the  tttlJOVA  test,  the 
results  hold  across  each  step  ahead  ir.  the  forecast  horizon  taken  by 
itself. 

General  Conclusion  and  Implications 

In  light  of  the  results  of  the  miitivariate  analysis  of  variance 
summarized  above,  the  conclusion  of  this  study  is  that  contposite  uncor- 
related  indices  computed  from  ratio,  iiarket  and  industry  data  provide 
no  additional  information  in  forecast:Lng  earnings  per  share  than  is 
already  contained  In  the  earnings  series  itself.  The  results  of  the 
study  tend  to  shed  doubt  on  the  hypothesis  that  MIMA  forecasts  can  be 
improved  by  considering  additional  data  in  the  internal  and  external  data 
sets. 

Limitations  and  Suggestions  for  Futura  Research 

The  study  is  limited  in  that  it  was   restricted  to  a  particular  in- 
dustry and  to  a  particular  set  of  predictor  variables.  One  line  of  future 
research  might  be  to  remove  these  restrictions.  In  addition,  as  with  all 
statistical  modeling  studies,  the  research  is  subject  to  the  models  and 
modeling  procedures  used. 

Another  potential  line  of  future  research  might  be  to  explore  an 
Industry  or  market  wide  transfer  function.  This  could  be  done  by  finding 
one  basic  structural  model  form  that  can  be  used  for  all  firms  in  a  given 
market  or  industry.  This  approach  htis  met  with  success  in  the  univariate 
case  (Brown  and  Rozeff,  1977). 
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Notes 

Information  is  asi.iinied  to  be  data  with  predictive  value. 

2 

Net  worth  to  long  term  debt  could  not  be  computed  for  firms  7  and 

21  due  to  the  nonexistence  of  long  tens  debt  for  a  substantial  number  of 
quarters. 

3 

Both  the  Dew  and  industry  irsdices  were  computed  from  averaging 

quarterly  data  taken  from  Security  Owners  Stock  Guide  (Standard  and  Poors' 
corporation) . 

A 

The  stock  prices  were  adjusted  for  splits  and  dividends.  Also 
for  firms  1,  3,  5,  7,  15,  16,  17,  21,  22,  24  and  29,  a  complete  series 
of  stock  prices  were  not  available. 

EPS  were  taken  from  Moody's  Handbook  and  adjusted  for  stock 
splits  and  stock  dividends.   In  addition  for  firmt-  1,  2,  3,  5,  7,  12,  14, 
15,  16,  18,  20,  21,  22,  24,  25  and  29,  EPS  were  computed  using  informa- 
tion from  schedule  B~3  at  the  Civil  Aeronautics  Board  (CAB)  from  41  in 
conjunction  with  the  CAB  quarterly  periodical  Air  Carrier  Financial 
Statistics. 

c 

Note  that  the  or thogonalization  is  cross-sectional  and  therefore 

lagged  cross-correlations  can  exist.   This  means  that  the  information 
content  of  the  factors  is,  in  general,  not  strictly  additive. 

Absolute  percentage  error  is  computed  by  the  formula: 


ictual  EPS  -  forecasted  EPS 
8 


^ctual  EPS  -  forecasted  EPS. 
^      act!ial  EPS         ^ 


The  modeling  was  done  using  a  program  first  written  by  David 
Pack  of  the  Ohio  State  University  and  modified  for  local  use  at  the 
University  of  Illinois  by  James  HcKeown.   In  condensed  form  the  models 
occupy  20  pages  and  thus  are  not  presented  in  this  study;  however  they 
will  be  furnished  upon  \«ritten  request  to  the  author. 

In  a  relatively  small  number  of  cases  there  was  missing  data  due 
to  missing  actual  EPS  and  cases  where  the  absolute  forecast  error  cannot 
be  computed  due  to  zeros  la  the  denominators.   In  such  cases  observa- 
tions were  replaced  by  their  cell  means.  A  listing  of  these  cases  is 
presented  in  Appendix  2. 

This  criterion  alone  is  discussed  because  it  is  probably  the  most 
common  and  well  known.   For  a  discussion  of  the  various  MAJ?OVA  statistics 
see  Morrison  (1967).  Note  that  all  of  the  tests  presented  resulted  in  ap- 
proximately the  same  significance  level. 


APPENDIX  1 
DESCRIPTION  OF  THE  SAMPLE  EMPLOYED 

The  constraining  factor  in  sample  selection  was  meeting  the  above 
mentioned  information  requirements.   P^ndom  sampling  was  not  possible 
because  only  30  firms  met  the  information  requirements  of  60  quarters 

of  income  statement  and  balance  sheet  information.   Below  is  a  list  of 
these  firms. 

1.  Airlift  International 

2.  Alaska  Airlines 

3.  Aloha  Airlines 

4.  American  Airlines 

5.  Aspen  Airways 

6.  Branlff  Airways 

7.  Caribbean  Atlantic  Airlines 

8.  Continental  Airlines 

9.  Delta  Airlines 

10.  Eastern  Airlines 

11.  Tiger  International  Airlines 

12.  Frontier  Airlines 

13.  Hawaiian  Airlines 

14.  National  Airlines 

15.  New  York  Airways 

16.  North  Central  Airlines 

17.  North  West  Airlines 

18.  Ozark  Airlines 

19.  Pan  American  Airways 

20.  Piedmont  Airlines 

21.  Reeve  Airlines 

22.  SFO  Airlines 

23.  Seaboard  World  Airlines 

24.  Southern  Airways 

25.  Texas  International  Airlines 

26.  Trans  World  Airlines 

27.  UAL  (United  Airlines) 

28.  Western  Airlines 

29.  Wien  Airlines 

30.  Alleghenay  Airlines 

Each  firm  will  be  subsequently  referred  to  by  the  identifying  number  that 
precedes  It. 


APPEN^DIX  2 

DESCRIPTION  OF  AVAILABLE  DATA  FOR 
FORECAST  ERROR  ANALYSIS 


This  appendix  gives  a  firm  by  firm  description  of  the  number  of 
quarters  of  data  avilable  for  forecast  error  analysis.   For  each  firm 
the  number  of  periods  in  the  base  period,  the  origin  date  for  forecast- 
ing, and  the  number  of  absolute  percentage  forecast  errors  is  presented. 

nurcber  of  steps  ahead 
for  which  absolute 
percentage  forecast 
error  was  computed 

10 
10 
10 
10 
10 
8 
9 
7 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
10 
2 
10 


firm 

number  of  periods 

origin  date  for 

number 

in  base  period 

forecasting 

1 

50 

4/74 

2 

49 

4/74 

3 

50 

4/74 

4 

50 

4/74 

5 

30 

4/74 

6 

30 

3/74 

7 

40 

4/71 

8 

50 

4/74 

9 

50 

4/74 

10 

50 

4/74 

11 

50 

4/74 

12 

50 

4/74 

13 

49 

4/74 

14 

50 

4/74 

15 

50 

4/74 

16 

50 

4/74 

17 

50 

4/74 

18 

50 

4/74 

19 

50 

4/74 

20 

50 

4/74 

21 

50 

4/74 

22 

42 

4/74 

23 

50 

4/74 

24 

50 

4/74 

25 

50 

4/74 

26 

50 

4/74 

27 

50 

4/74 

28 

50 

4/74 

29 

30 

2/76 

30 

50 

4/74 

M/B/87 
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